07. Dataset Schema Analysis

Dataset Schema Analysis

ND320 AIHCND C01 L01 A07 Dataset Schema Analysis

Key Things to Consider:

  • Identify the predictor
  • Identify categorical, numerical features
  • Work with SMEs and Domain Experts
  • Domain knowledge is key to representing data correctly

You can find this dataset in the notebook on this page to inspect the dataset as well as use the link below to get more specific information for the different categories.

UCI Heart Disease Dataset

Code

If you need a code on the https://github.com/udacity.

Data Schema Quiz 1

QUIZ QUESTION::

Match what is True or False when analyzing the schema of a dataset?

ANSWER CHOICES:



Statement

True/False

Identify correlations between features

Domain expertise is not needed if you utilize recursive feature elimination.

Identify the predictor attribute/feature.

Identify possible categorical and numerical features.

SOLUTION:

Statement

True/False

Identify the predictor attribute/feature.

Identify possible categorical and numerical features.

Identify correlations between features

Domain expertise is not needed if you utilize recursive feature elimination.

Identify correlations between features

Domain expertise is not needed if you utilize recursive feature elimination.

Identify the predictor attribute/feature.

Identify possible categorical and numerical features.

Identify the predictor attribute/feature.

Identify possible categorical and numerical features.

Identify correlations between features

Domain expertise is not needed if you utilize recursive feature elimination.

Data Schema Quiz

QUIZ QUESTION::

Part - 1: Which features are categorical and which are numerical? Please use the Notebook workspace above and the UCI Heart Disease Dataset site to help you with this quiz.

ANSWER CHOICES:



Feature

Type

slope

thalach

thal

cp

trestbps

SOLUTION:

Feature

Type

slope

thal

cp

thalach

trestbps

thalach

trestbps

slope

thal

cp

slope

thal

cp

Data Schema Quiz

QUIZ QUESTION::

Part - 2: Which features are categorical and which are numerical? Please use the Notebook workspace above and the UCI Heart Disease Dataset site to help you with this quiz.

ANSWER CHOICES:



Feature

Type

fbs

ca

oldpeak

sex

SOLUTION:

Feature

Type

fbs

ca

sex

fbs

ca

sex

fbs

ca

sex

oldpeak